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ABSTRACT 

We measure the angular auto-correlation functions, lj(9), of SDSS galaxies selected 
to have photometric redshifts 0.1 < z < 0.4 and absolute r-band magnitudes M r < 
—21.2. We split these galaxies into five overlapping redshift shells of width 0.1 and 
measure ui(8) in each subsample in order to investigate the evolution of SDSS galaxies. 
We find that the bias increases substantially with redshift — much more so than one 
would expect for a passively evolving sample. We use halo-model analysis to determine 
the best-fit halo-occupation-distribution (HOD) for each subsample, and the best-fit 
models allow us to interpret the change in bias physically. In order to properly interpret 
our best-fit HODs, we convert each halo mass to its z — passively evolved bias (b a ), 
enabling a direct comparison of the best-fit HODs at different redshifts. We find that 
the minimum halo b required to host a galaxy decreases as the redshift decreases, 
suggesting that galaxies with M r < —21.2 are forming in halos at the low-mass end 
of the HODs over our redshift range. We use the best-fit HODs to determine the 
change in occupation number divided by the change in mass of halos with constant b a , 
AN/ AM (b a ), and we find a sharp peak at b a ~ 0.9 — corresponding to an average halo 
mass of ~ 1O 12 /i -1 M0. We thus present the following scenario: the bias of galaxies 
with M r < —21.2 decreases as the Universe evolves because these galaxies form in 
halos of mass ~ 10 12 ft- _1 Af Q (independent of redshift), and the bias of these halos 
naturally decreases as the Universe evolves. 

Key words: Galaxies - clustering: formation. 



1 INTRODUCTION 

The angular auto-correlation function of galaxies, u>(9), en- 
codes a wealth of information, about both cosmology and 
the properties of galaxies. Wide-field surveys, such as the 
Sloan Digital Sky Survey (SDSS), allow precise calculations 
of ui{ff) over a range of scales spanning over three orders of 
magnitude — thereby probing both the clustering of galaxies 
dominated by interactions within dark matter halos and also 
the clustering of galaxies that is determined by the matter 
density field. Angular clustering measurements made using 
data from photometric surveys are complicated by the fact 
that such surveys can only easily provide precise information 
on the locations of galaxies in two dimensions, while many 
analyses of interest require knowledge of the three dimen- 
sional distribution. Multi-band surveys, such as SDSS, allow 
estimation of photometric redshifts, and thus with careful 
treatment, one can estimate the radial distribution of galax- 
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ies and split the galaxies by redshift, type, and luminosity. 
As a result, one can investigate the evolution of galaxies with 
photometric data. The techniques to do this are gaining in 
importance, as many of the next generation of wide-field 
surveys will rely primarily on photometric redshifts to gain 
knowledge of their respective radial distributions (e.g., DES, 
PanStarrs, LSST). 

Using the auto-correlation function of galaxies to study 
their properties has been aided in recent years by the devel- 



opment of the 'halo-model' (see, e.g. 


Kauffmann et al.|1997 


|Peacock & Smith|[2000| ICooray & S 


heth||2002| |Zheng et al.| 



|2005[ |Tinker et al.|20 05) as a way of parameterising galaxy 
bias. One can fill dark matter halos with galaxies based on 
a statistical 'halo-occupation-distribution' (HOD), allowing 
one to model the clustering of galaxies within halos (and 
thus non-linear scales) while providing a self consistent de- 
termination of the bias at linear scales. Thus, as shown by, 
e.g., |Zehavi et al.| (|2004|), |Blake et al.| (|2008|), |Tinker et al 



( |2008| ), Ross fc Brunner (2009; hereafter R09) one can use 
measurements of galaxy auto-correlation functions to con- 
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strain the HODs of different sets of galaxies and to gain 
information on the nature in which galaxies occupy dark 
matter halos. 

Many recent studies have used clustering measurements 



to study the evolution of galaxies. Wake et al. ( 2008 1 and 



Brown et al. (2008) measured the auto-correlation functions 



of luminous red galaxies (LRGs) and interpreted the re- 
sults with the halo model to show that their evolution is 
inconsistent with passive evolution, while Tojeiro & Perci- 
vaI| ( |20T0t were able to determine, via luminosity weighted 
power-spectrum measurements, that the non-passive evolu- 
tion is due primarily to lower luminosity LRGs. Zhen g et al.| 
( |2007[ > used auto-correlation function measurements to con- 
strain the HODs of SDSS spectroscopic galaxies (z ~ 0.1) 
and DEEP2 galaxies [z ~ 1), allowing them to investigate 
the evolution of the HOD, stellar mass, and satellite frac- 
tion of galaxies over a relatively large range of luminosities. 
Ross et al. (2007; hereafter R07) studied the clustering of 
SDSS DR5 galaxies split by redshift between z < 0.3 and 
0.3 < 2 < 0.4, and found significantly larger bias at higher 
redshift, especially for late-type galaxies. 

In this paper, we use galaxies photometrically selected 
from the SDSS seventh data release (DR7) to investigate 
the evolution of galaxies between redshifts 0.1 and 0.4. While 
this represents a relatively small range in redshift, our study 
offers a unique combination in that it utilizes over 6000 
square degrees of observing area (after masking) and the 
evolution we study is for galaxies drawn entirely from the 
SDSS (and we thus do not have to worry about selection 
techniques of different surveys or differing calibration is- 
sues). We are thus able to precisely measure the angular 
auto-correlations of SDSS galaxies, use the halo-model to 
interpret them, and self-consistently compare the results at 
different redshifts. 

Our paper is outlined as follows: Sj2] describes the cre- 
ation of our galaxy catalog and its five subsamples, its an- 
gular masking, and our methods for estimating the redshift 
distributions of each subsample; [J3] describes how we mea- 
sure the angular auto-correlation functions of these galaxies 
and how we model the results; fj4] presents the results of 
our auto-correlation function measurements and the best-fit 
HOD for each redshift slice; in §5 we use cross-correlation 
measurements to investigate potential systematics, in S|6]we 
discuss the physical implications of our results; finally, we 
conclude in SjT] Throughout this work, we assume a flat cos- 
mology with f2 m = 0.3, h — 0.7, as ~ 0.8, and ilb ~ 0.05. 



2 DATA 

We use data from the Northern, contiguous area of the 
SDSS seventh data release (DR7). This survey obtains wide- 
field CCD photometry ( Gunn et al.|1 998|) in five passbands 



r,i,z; e.g. 



Fukugita et al.|1996 1. DR7 contains a mod- 
erate increase over the DR5 imaging area (~ 500 square 
degrees), but the precision and accuracy of its photometric 
redshift catalog represents a substantial improvement over 
previous data releases (Abazajian et al. 2009). We select 
galaxies from the DR7 photoz catalog with de-reddened r- 
band magnitudes (r<j) less than 21. Redshifts in this catalog 
were estimated using a hybrid template/empirical approach, 
and the output includes rest-frame absolute magnitudes, k- 




Figure 1. The M r ,z parameter-space in which we find galaxies 
is shaded black. Red dotted lines define the boundaries of our 
selection criteria (M r < —21.2, z < 0.4). 



corrections, and galaxy-type values. We use this information 
to construct an approximately volume-limited sample, using 



the methods outlined in Budavari et al. ( 2003 1. Our resulting 



criteria are that galaxies have photometric redshifts z < 0.4 
and r-band absolute magnitudes, M r < —21.2 (equivalent 
to M r - 51og 10 ft < -20.43). 

Figure [T] displays our galaxy selection criteria in dotted 
redlines, plotted over the shaded black region of M r ,z pa- 
rameter space where galaxies with rd < 21 exist in the SDSS 
DR7 photoz catalog. We make our selection at M r < —21.2, 
rather than the very edge of the shaded region (~ —20.7), in 
order to account for differences in k-corrections between dif- 
ferent galaxy types (which can be as high as 1 at z = 0.4 in 
the r-band). We applied imaging masks and cuts on seeing 
and reddening at l'.'5 and A r = 0.2 (as in R07) to the sur- 
vey area, while cutting out data with flags indicating poor 
photometry/spurious object detection. This left a total of 
3,123,487 galaxies with 0.1 < z < 0.4, occupying 6019 square 
degrees of observed sky. 

We split the sample by redshift into five samples with 
0.1 < z < 0.2, 0.15 < z < 0.25, 0.2 < z < 0.3, 0.25 < 
z < 0.35, and 0.3 < z < 0.4. While splitting the samples in 
this manner means that they are not mutually exclusive, it 
allows for a test of whether the redshift evolution is smooth 
(any sharp transition might imply a systematic in the data). 
Reducing the width of the redshift slices further would not 
provide significantly more information, as the error on the 
photometric redshifts is ~ 0.05 at z ~ 0.3. 



2.1 Estimating True Redshift Distributions 

We take care in estimating the form of each of our redshift 
distributions, as this is quite important to our analysis. We 
treat each individual galaxy's redshift as a Gaussian PDF 
based on its maximum likelihood redshift and associated 
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error, and convolve this PDF with volume and luminosity 
function (LF) constraints. Volume arguments imply that a 
galaxy is more likely to have a larger redshift than a smaller 
one, while LF arguments imply that a galaxy is more likely 
to be faint than bright. Thus, we sample the Gaussian PDF 
to find a redshift we refer to as z and determine the sampled 
absolute magnitude, M', by adding to M r the difference in 
distance modulus between z and z' . In order to apply the 
volume and LF constraints, we weight each sampled redshift 
by 



/»« = (x(z')/x(z)) 2 $(M')/$(M r ), 



(1) 



where x{z) is the comoving distance to redshift z, and 
$(M) is the best-fit Schecter form of the LF determined by 
Montero-Dorta & Pradal (|2009b for r-band SDSS galaxies. 



We sample each galaxy's Gaussian PDF fO times and 
find f n z for each sampling. We normalise such that the sum 
of f nz adds to 1 for each galaxy (in order to insure that 
each galaxy contributes to the overall dN/dz at the same 
level) and then add each of the fO normalised f n z to their 
appropriate bin (we use bins of width Az — 0.001). Thus, 
when completed for galaxies in a given sample, we have an 
estimate for the total number galaxies at z±0.0005 included 
in the sample. Each dN/dz is then normalised by dividing 
each bin by £\ dN /dz(zi)*0. 001 and we interpolate between 
bins to obtain a continuous, normalised, dN/dz. 

Our construction of dN/dz eliminates unphysical results 
- such as non-zero probability at redshift 0. It is simi- 
lar to the treatment presented in section 4.2 of |Budavari| 
|et al.| ( |2003[ ), but we apply the treatment to each galaxy 
rather than bin in magnitude. In general, the resulting red- 
shift distributions are similar to the distributions one gets 
from Gaussian sampling (the LF and volume effects tend to 
cancel each other) but have lower values at the tails of the 
distribution. 

Figure [5] displays the normalised (such that they in- 
tegrate to 1) redshift distributions of our galaxy sample 
with 0.3 < z < 0.4. The solid line displays dN/dz deter- 
mined using Gaussian sampling combined with LF and vol- 
ume considerations, while the dashed line displays the re- 
sult obtained using only Gaussian sampling. For this sam- 
ple, the median redshift is shifted to a slightly lower value, 
and the distribution has lower values at the tails (resulting 
in a stronger peak). The shift in redshift is due to the fact 
that our galaxies have magnitudes close to M*, and the LF 
thus suggests that a decrease in redshift is more likely than 
an increase since the decrease will lower the luminosity. 

Figure [3] displays the normalised redshift distributions 
for each redshift slice (incorporating the LF and volume ef- 
fects, as one can assume we do from here-on). The distri- 
butions get wider as the median redshift increases due to 
the fact that the mean redshift error increases with redshift 
(see Table [T]). Each individual distribution appears roughly 
Gaussian. 

We can estimate the true absolute magnitude distribu- 
tions in a manner that is similar to our estimation of dN/dz. 
We follow the procedure outlined at the beginning of this 
section, but bin in M r instead of redshift. It is important 
to consider the true distribution of M r in each of our sam- 
ples because one may worry that applying the same cut on 
absolute magnitude to samples with differing photometric 
redshift errors could yield significantly different magnitude 




Figure 2. The normalised (such that they integrate to 1) redshift 
distributions for galaxies in our sample with 0.3 < z < 0.4, de- 
termined combining Gaussian sampling with LF and volume con- 
siderations (solid line) and using only Gaussian sampling (dashed 
line) . 



distributions. Figure [4] displays the normalised (such that 
they integrate to 1) M r distributions for each of our samples 
(0.1 < 2 < 0.2, black; 0.15 < z < 0.25, red; 0.2 < z < 0.3, 
blue; 0.25 < z < 0.35, green; and 0.3 < z < 0.4, ma- 
genta). The peak of each of the distributions occurs near 
M r = —21.5 and there are only slight differences between 
the samples. The higher redshift samples display slightly 
broader distributions (due to the fact that they have larger 
photometric redshift errors) and the broadening occurs most 
prominently at the faint end of the distributions. This sug- 
gests, that, if the differences in these distributions cause any 
effect at all, it will be to slightly decrease the bias of the 
higher redshift samples. 



2.2 Estimating True Completeness 

As in Ross & Brunner (2009; hereafter R09), we use the red- 
shift distribution to calculate the weighted number density, 



of observed galaxies using 



dz 



dN 



4nf obs x 2 (z)c dz \ dz I 



H{z) 



I 



,(2) 



where x(z) is the comoving distance to redshift z, f ), s is the 
fraction of observed sky, c is the speed of light, H(z) is the 
rate of expansion, and dN/dz is the un- normalised redshift 
distribution. Equation ([2| determines the number density of 
galaxies that contributes to a clustering measurement, and 
is therefore the best estimate of the observed number density 
of galaxies in a photometric redshift bin. 

For our models, we need to know the true number den- 
sity of galaxies for a given sample, and this is necessar- 
ily larger than the observed number density obtained from 
Equation If we assume our observed galaxies are a ran- 
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z 



Figure 3. The normalised (such that they integrate to 1) redshift 
distributions for each of the five photometric redshift slices 0.1 < 
z < 0.2, 0.15 < z < 0.25, 0.2 < z < 0.3, 0.25 < z < 0.35, and 
0.3 < z < 0.4. 




M r 



Figure 4. The normalised (such that they integrate to 1) absolute 
r-band magnitude (M r ) distributions for each of the five photo- 
metric redshift slices 0.1 < z < 0.2 (black), 0.15 < z < 0.25 (red), 
0.2 < z < 0.3 (blue), 0.25 < z < 0.35 (green), and 0.3 < z < 0.4 
(magenta). 



dom sampling of this complete sample, the incompleteness 
should not affect our ability to model our measurements as 
long as we have a good estimate of the true number den- 
sity. We therefore assume the true number density of galax- 
ies, n g T, does not evolve with redshift and estimate n g r by 
taking the total number of galaxies in the 0.1 < z < 0.2 
sample and dividing it by the co-moving volume given by 
this redshift range. This yields n gT = 0.0049/i 3 Mpc" 3 . We 
can thus compare the number density given by Equation Q 
to 0.0049 /i 3 Mpc -3 in order to estimate the completeness of 
each sample. This will not be a concern for the modelling 
(which will always use n gT = 0.0049h 3 Mpc~ 3 ), but it will 
help us determine the quality of each of our data samples. 

All of our samples will suffer incompleteness, in terms 
of the fraction of galaxies that contribute to the clustering 
signal, due to galaxies whose photometric redshift estimates 
have scattered them out of a particular redshift bin. This 
effect is made clear by considering the redshift distributions 
of Figure [3] Each individual distribution clearly displays a 
significant portion of its curve lying outside bounds of its 
hard cut on photometric redshift (for example, the dN/dz of 
the 0.3 < z < 0.4 slice has significant amplitudes at z < 0.3 
and z > 0.4). Naturally, this effect grows larger as the mean 
redshift error of galaxies increases. 

We display the number, the weighted number density 
(n g ), the median redshift (z), the mean redshift error of 
galaxies (Az), the median r-band absolute magnitude (M r ), 
the completion (i.e. n 9 /0.0049), and the weighted fraction 
of galaxies with type value t > 0.1 (fiate) for each redshift 
slice in Table 1. The M r are calculated from the magnitude 
distributions displayed in Figure |4] As the fi gure suggests, 
the values of M r are extremely similar each other across all 
samples — to the nearest tenth of a magnitude they are all 



equal to -21.6. This suggests that we are comparing galaxies 
of the same luminosity in every sample. 

The sharp decrease in completion for galaxies with z > 
0.2 is coupled with a sharp increase in the mean redshift 
error. For our sample with the highest redshifts (0.3 < z < 
0.4), there is a significant decrease in the completion (from 
0.48 to 0.42), but the increase in mean redshift error is not 
as significant (0.051 to 0.054). This suggests that the data 
may be suffering from incompletion in the parent rd < 21 
sample, due to, e.g., low surface brightness objects (which 
|Blanton etldT| 2005 suggests may be an issue) and also the 
effects of Malmquist bias, since our z < 0.4 limit is imposed 
partly due to our rd < 21 limit. 

We calculate the weighted late-type fraction by finding 
n g date via Equation |2| (using dN/dzi ate ) and dividing the 
result by n g . We split the data at t — 0.1, since this split 
yields similar late-type fractions as the split used by R09. 
The value of fiate is important, as it affects both the overall 
bias and the shape of lj(6). For all of the data with z < 0.35, 
the fiate are consistent to within 10% (and there is no overall 
trend). There is, however, a large decrease in fi ate in the 
highest redshift slice. This suggests that the large decrease in 
completeness in this slice may be tied to a deficit of late-type 
galaxies in our sample. We should therefore be careful when 
comparing measurements from the 0.3 < z < 0.4 slice to the 
lower-redshift ones. Conversely, the agreement between the 
slices with z < 0.35 encourages comparisons between these 
four samples. 
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Table 1. Characteristics of the five SDSS DR7 samples with M r < —21.2 used to calculate angular auto-correlation functions; n g is the 
weighted observed number density, z is the median redshift, Az is the mean redshift error, f c is the completion (n s /0.0049), and fi a t e is 
the weighted fraction of late-type galaxies. 



Redshift range Number of galaxies n g (h 3 /Mpc 3 ) z Az M r f c fiate 



0.1 < z < 0.2 


483,655 


0.00377 


0.16 


0.022 


-21.63 


0.77 


0.38 


0.15 < z < 0.25 


771,681 


0.00369 


0.2 


0.027 


-21.60 


0.75 


0.35 


0.2 < z < 0.3 


1,027,754 


0.00272 


0.25 


0.041 


-21.57 


0.56 


0.35 


0.25 < z < 0.35 


1,406,302 


0.00236 


0.3 


0.051 


-21.56 


0.48 


0.37 


0.3 < z < 0.4 


1,612,078 


0.00204 


0.34 


0.054 


-21.59 


0.42 


0.29 



3 MEASUREMENT AND ANALYSIS TOOLS 
3.1 Measurement Techniques 

We calculate the angular auto-correlation function, lj(0), of 
galaxies using the |Landy fc Szalay| ( |1993[ ) estimator: 



u{9) 



DP (6) - 2DR(6) + RR{6) 
RR{0) : 



(3) 



where PP is the number of galaxy pairs, PR the num- 
ber of galaxy-random pairs, and RR the number of random 
pairs, all separated by an angular distance 6 ± Ad. We will 
also measure angular cross-correlation functions, uj x (9), for 
which the Landy & Szalay (1993) estimator is 



w.(0) 



gigaff) - PiR{6) - £>2.fi(fl) + RR(6) 
RR(6) 



(4) 



where P\ and P2 represent separate data samples. We can 
always employ the same random catalog (which includes 10 
million points) since all of our galaxies have the same angular 
selection. 

We use a jackknife method (e.g., |Scranton et al.|2002 |, 
with inverse-variance weighting to estimate our errors and 
covariance matrix (e.g., [Myers et aL||2007[ ). The method is 
nearly identical to the method described in detail in R07. 
The jackknife method works by creating many subsamples 
of the entire data set, each with a small part of the total 
area removed. R07 found that 20 jack-knife subsamplings are 
sufficient to create a stable covariance matrix, and we find 
similar results for DR7. These 20 subsamples are created by 
extracting a contiguous grouping of 1 /20th of the unmasked 
pixels in 20 separate areas. Our covariance matrix, Cj ac k, is 
thus given by 



jack — Cjack^ijCj'J 



u} k (0i)][wf u u(0j) —uj k {0j)], 



(5) 



where uik(0) is the value for the correlation measurement 
omitting the fcth subsample of data and i and j refer to the 
i th and j th angular bin. The jackknife errors are simply the 
square-root of diagonal elements of the covariance matrix. 
Such a technique should account for statistical errors due to 
variations in both the angular plane and the radial direc- 
tion, as each jackknife represents a different realisation of 
the radial selection. 

|Norberg et al.| ( |2009[ ) have shown that using a jack- 
knife method to estimate covariance matrices does not yield 
perfect results. For projected correlation function measure- 
ments (the case they study most similar to ours) the jack- 
knife method does well at large scales, but over-predicts the 



variance at small scales. For the covariance, again the jack- 
knife method is shown to be imperfect. We do not feel this is 
a major issue for the interpretation of our measurements as 
the conclusions we draw will not depend heavily on the exact 
nature of the covariance matrices. We explore this further 
in section [6721 



3.2 



Transformation to Angular Correlation 
Function 



Our theoretical modeling will produce galaxy-galaxy power 
spectra, P(k, r). Thus, we must Fourier transform the model 
power spectra to a real-space correlation function, £(r), 



1 

2^2 



dk P(k,r)k' 



. sin kr 
kr 



(6) 



where r is the real-space distance and our model P (fc, r) will 
depend on it due to halo-exclusion (see section [3~4| . We use 
Limber's equation ( Limber||1954[ ) to project the real-space 
model to angular space (assuming a flat Universe): 

u{8) = 

2/c J o °° dz H{z)(dN/dz) 2 J °° du £(r = ^/u 2 + x 2 (z)9 2 ), 

where dN/dz is the normalised redshift distribution, and 
x(z) is the comoving distance to the median redshift z. The 
factor l/cj^°dz H(z)(dN/dz) 2 essentially tells one how 
much the radial extent of the galaxy distribution dilutes 
the real-space clustering signal. This will therefore change 
for each redshift sample, and it is thus an important factor 
when comparing results between different redshift slices. We 
therefore define 



W = l/c / dz H(z)(dN/dzf 



(8) 



and we will use this factor W in order to enable direct com- 
parison of our measurements to each other. 

Recent studies (e.g. Padmanabhan et al.|2007 Baldauf 



et al. 2009 Nock et al. 2010 1 have shown that redshift distor- 



tions can significantly affect projected correlation function 
measurements. The importance of the redshift distortion ef- 
fect grows with the effective scale, and at the scales we probe 
(r eq < 15/i _1 Mpc), it would increase our models by at most 
~ 5%, given that the sizes of our radial windows are all 
greater than 250 /i _1 Mpc. This suggests that including the 
effects of redshift distortions would not alter our models sig- 
nificantly enough to alter any of our conclusions. 
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3.3 Passive Evolution 

In order to compare our measurements in different redshift 
shells, we must take into account the evolution of the clus- 
tering of the dark matter. The overall growth of structure 
in the Universe implies that the bias, 6(2)1 °f a passively 
evolving set of galaxies (i.e., no mergers) will tend towards 
unity. Specifically, this is expressed as 



b(z 1 ) = l+(b(z 2 )-l)D(z 2 )/D(z l ), 



(9) 



(see, e.g., Fry 1996, Tegmark & Peebles 1998) where D(z) 
is the linear growth factor. Thus, assuming no evolution in 
the physical properties of individual galaxies, a passively 
evolving set of galaxies with b — 1.40 at 2 = 0.34 should 
have a bias of 1.335 today (2 = 0). Given that = 

( 6(z2)o(z2) ) ' one can ex P ress the ratio of the real-space 
clustering amplitude between two redshifts as 



f t (zx,z 2 ) = 



b{Zi) 



b(z 1 )-l + D(z 2 )/D(z 1 ) 



(10) 



Given Equations (jSp and (10 1, we can account for the 



expected changes in the angular clustering due to both the 
changes in the widths of the redshift distributions and the 
median redshifts. Given the median redshift, 2, one can de- 
termine u)(r eq ) by finding the equivalent physical scale of a 
given angular separation via 



r eq = 2x(z)tan(0/2), 



(11) 



(where again x(z) is the comoving distance to median red- 
shift 2). Thus, the expected difference between uj(r eq ) mea- 
sured at z\ and 22 can be expressed as 

/»(2i,30 = / e WVW' a , (12) 

where W\ is determined via Equation Q for redshift dis- 
tribution with median redshift z\. Any difference between 
(21,22) and "(^"'"^j would thus be due evolutionary ef- 
fects, such as the merging or dimming of galaxies. 



3.4 Halo Modeling 

We use the halo model to produce model galaxy auto- 
correlation functions using techniques similar to those out- 
lined in R09. We assume that the overall galaxy power spec- 
trum can be modeled as having a contribution due to galaxy 
pairs within dark matter halos (the '1-halo' term) and a con- 
tribution due to galaxies pairs in separate halos (the '2-halo' 
term) . The number of galaxies expected to occupy a halo is 
modeled as a function of mass, and the 1 and 2-halo terms 
can be self-consistently determined given this HOD. 



As in previous studies (see, e.g., Zheng et al. 2005 Blake 
et al.|2008 R09), we assume separate mean occupations for 
central galaxies, N C (M) and for satellite galaxies, N S (M). 
Thus, 



N(M) = N C {M) x (1 + N a (M)), 



(13) 



where we are assuming that only halos with central galaxies 
can have satellite galaxies. This allows for two 1-halo com- 
ponents — one for central-satellite pairs, P ca (k), and the 
other for satellite-satellite, P aa (k), pairs. They are given by 
(see, e.g., Appendix B of |Skibba fc Sheth|2009 | 



dMnmN^M) 2 ^^, (14) 



M„ iT .(r) 



-gT 



P ss (fc) = / dMn(M)N c (M) 



(N a (M)u(k\M)Y 



(15) 



where the factor n(m) is the halo number density, for which 
we use the Jenkins et al. (20011 model (and is implicitly 
dependent on redshift), and u(k\M) is the Fourier trans- 



form of the Navarro, Frenk, fc White (19971 (NFW) dark 



matter profile. We have implicitly assumed that the satellite 
galaxies are poisson distributed (as is found to be a good ap- 
proximation by, e.g., Kravtsov et al.|2004 Zheng et al.|2005 1 
allowing the use of N a (M)' z in place of (N s (M)(N a (M) — l)). 
The 2-halo term is given by 



Pih{k,r) 



where P„. 



Pm atter(k) 



(16) 



is the matter power-spectrum determined via 



the fitting formulae of Smith et al.| d2003|) and b(M, r) is the 



scale dependent bias of halos. This bias can be expressed 
( |Tinker et al.|2005[ ) as 

[1 + 1.17UM] 1 - 49 



6 (M,r) = B\M) 



[1 + 0.69UM] 2 - 09 ' 



(17) 



where B(M) is the bias of halos, which we calculate using 



the model of Sheth et al. (20011 with the parameterisation 



determined by Tinker et al. ( 2005 I (and is implicitly de- 



pendent on redshift) and f m (r) is the non-linear real-space 
matter 2-point correlation function, determined by Fourier 
transforming the matter power spectrum. The parameter 
Mu m (r) is the mass limit due to halo-exclusion, which we de- 



termine using the methods described by Tinker et al. ( 2005 1 



and Blake et al. ( 2008 1 



One can calculate the average number density of galax- 
ies for a given HOD, n g H, via 



n gH 



dMn(M)N(M), 



and the restricted number density of galaxies, n' g , via 

cM Um (r) 

&Mn{M)N(M). 



(18) 



(19) 



The full model galaxy-galaxy power spectrum is thus given 
by P(k,r) = P ca (k) + P aa (k) + P 2h (k,r). 

For the central galaxy HOD, we use the same parame- 
terisation as in R09, i.e., 



N C (M) = 0.5 



1 + erf 



logio {M/M cut ) 



&cut 



For the satellite galaxy HOD, we use 



N a (M) 



( M-M cu t y 
V Mi J 



(20) 



(21) 



This i s similar to the parameterisation used by |Zheng et al.| 



(2007), N S (M) = ( M ~^° ) a , but we use M cut instead of 



adding a new M parameter. This is motivated by the fact 
that, in Zheng et al.| pfJ07| , M is loosely constrained, but 
consistent with M cu t for each of their SDSS samples. Our 
parameterisation thus allows for a physically motivated form 
for the satellite HOD, without adding an extra parameter 



SDSS Clustering Evolution 7 



into the model. The total mean occupation of halos at a 
given mass is therefore determined by entering Equations 



(201 and (21 1 into Equation (13 1 



The HOD model has four free parameters, but one can 
be removed by requiring that n g n calculated via Equation 
( 18 1 matches the observed number density of galaxies, n g r 
(which we take to be 0.0049 /i 3 Mpc -3 for each sample; see 
Thus, given a, Mi, and a cu t, we find the M au t that 
yields a match between n 9 T and n g n- For comparison pur- 
poses, we will wish to know the linear bias, bi, and the satel- 
lite fraction, f sa t, intrinsic to a given HOD. These can be 
expressed as 



bi = 



and 



fsat 



1 



AM B(M)n(M)N (M) (22) 



AMn{M)N c (M)N 3 {M). (23) 



4 CLUSTERING MEASUREMENTS AND 
BEST-FIT HODS 

Our measured angular auto-correlation functions, nor- 
malised using Equation \12\ , are plotted (error-bars) against 
the equivalent physical scale in the top panel of Figure 
[5] for each of our redshift slices (0.1 < 2 < 0.2 black, 
0.15 < 2 < 0.25 red, 0.2 < 2 < 0.3 blue, 0.25 < z < 0.35 
green, 0.3 < z < 0.4 magenta). We use the bias of best- 
fit HOD model for the 0.2 < z < 0.3 sample (1.246; see 
Table [5| as the b(zi) that enters Equation ( |10[ ) (and then 
Equation ( |12[ |), since this model is most consistent with the 
measurements at large angular scales (see Figure [6|. All of 
the measurements display a turnover at ~ 50/i _1 Kpc, sug- 
gesting a minimum physical scale below which we can not 
observe a pair of M r < —21.2 galaxies. Looking specifically 
at 0.3 < 2 < 0.4 sample, the measured amplitudes at small 
scales are significantly higher than any of the other samples, 
suggesting that the low completion and late-type fractions 
of this sample have indeed influenced the resulting measure- 
ments. 

The amplitudes of the measurements in Figure [5] clearly 
grow larger as the redshift increases. This is made most clear 
by looking at the bottom panel of Figure [5j where we plot 
the same information as the top panel, but divide the ampli- 
tudes by the power-law 0.15r^ ' 8 . If our sample of galaxies 
evolved passively, all of the measurements in Figure [5] would 
be consistent with each other (in both panels). This sug- 
gests that there is significant evolution in the properties of 
galaxies with M r < —21.2, and that this evolution causes 
lower redshift galaxies to exist in significantly less biased 
environments than their high-redshift counterparts. 

The top panel of Figure [6] plots (same colour scheme as 
Figure [5| displays our measured auto-correlation functions 
(without any normalisation) along with the best-fit model 
for each measurement, while the bottom panel displays the 
same information divided by the power-law O.O20 -0 ' 8 . By 
eye, the fits appear good, and the greatest disagreement ap- 
pears to be at large angular scales for each measurement. 
This is due, in part, to the fact that the error-bars are 
largest at these scales. For the higher redshift samples, the 
largest disagreement is around scales ~4/i _1 Mpc. This is 



0.1 <z<0.2 

0.15<z<0.25 

0.2<z<0.3 

0.25 <z< 0.35 
0.3<z<0.4 



I I 



in 1 , 
1 ■ 
it ' 



1 i'h'^i' V'n 1111 It 1 ! 1 ! 1 ! 1 ! 1 !! 



r eq (h-> Mpc) 

Figure 5. Top panel: the measured (error bars) angular auto- 
correlation functions for five photometric redshift slices 0.1 < 
z < 0.2 (black), 0.15 < z < 0.25 (red), 0.2 < z < 0.2 (blue), 
0.25 < z < 0.35 (green), 0.3 < 2 < 0.4 (magenta). The ampli- 
tudes have been normalized to account for changes in the red- 
shift distributions and passive evolution and are plotted against 
the equivalent physical scale. Bottom panel: the same informa- 
tion as the top panel, only the ui values have been divided by the 
power-law 0.15r<rq ' 8 . 



large enough that the result is predominantly dependent on 
the 2-halo term. This hints that our assumed cosmology may 
be off, but such considerations are beyond the scope of this 
paper. 



4.1 Best-fit HODs 

The best-fit HOD parameters for each redshift slice are pre- 
sented in Table [2] For each measurement, we fit the model 
between 0.004° and 1.0°. There are 24 measurements over 
this range and thus 21 degrees of freedom for each fit. For 
the two highest redshift ranges, the value of \ 2 l v is greater 
than 1 (but never greater than 2). We do not think that 
this suggests a problem, because the covariance only takes 
the statistical error of the correlation function measurements 
into account. These two samples have the most data, their 
statistical errors are thus the smallest. There are many other 
sources of potential error, such as the assumed cosmology 
and the fitting formulae utilised by the models. 

Our results are slightly different to those of R09, who 
used galaxies with < 2 < 0.3 photometrically selected 
from SDSS DR5 with M r - 51og(A) < -20.5. Their best- 
fit M cut and a cu t are similar to ours, but their best-fit 
a and Mi are significantly higher (our are a ~ 1.1 and 
logi O (Mi/i/M ) ~ 13.3 while R09 found a = 1.27 and 
logio (Mi h/M@) = 13.49). The main reason for this is that 
they used a different treatment for N sa t, and this treatment 
naturally results in best-fits with both larger a and Mi val- 
ues. Another consequence of this change in the modeling is 
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e(degrees) M (Ir'Mj 



Figure 6. Top panel: the measured (error bars) and model (solid 
lines) angular auto-correlation functions for five photometric red- 
shift slices 0.1 < 2 < 0.2 (black), 0.15 < z < 0.25 (red), 
0.2 < z < 0.2 (blue), 0.25 < z < 0.35 (green), 0.3 < z < 0.4 (ma- 
genta). Bottom panel: same information as the top, except that 
the amplitudes of u)(9) are divided by the power-law 0.02<9 — o s . 



that the resulting satellite fractions are higher than those of 
R09. 

Compared to Zheng et al. (20071, the best-fit values 



of our HOD parameters mainly fall between their results for 
SDSS galaxies with M r -5log(h) < -20 and M r -51og(/i) < 
—20.5. This makes sense given that our cutoff is at M r — 
51og(/i) < —20.43 and it is made effectively less luminous by 
the fact that photometric redshift errors allow some lower 
luminosity galaxies into our samples. Comparing our best- 
fit values of Mi and M cu t, we find that M\/M cu t varies 
between 12.2 and 16.6, but is less than 13.8 only for the 
0.3 < z < 0.4 sample. Theoretical predictions by | Zheng et| 
al. ( 2005 I found the ratio to be ~ 14 and ~ 18 using two 
separate galaxy formation models, and our results fall in 
between these. 

As expected from Figure [5] the bias values of the best- 
fit HOD models increase with redshift. The driving factor 
behind this change is the value of <j cu t, as when its best- 
fit value drops, the bias increases. This is due to the fact 
that a smaller value of a cut results in a sharper cutoff in the 
HOD profile, fewer galaxies in low-mass halos, and thus a 
larger value for the overall bias. The change in the bias is 
significantly larger than the change expected from passive 
evolution. Given a bi of 1.2 at z = 0.16, the bias of a pas- 
sively evolving sample is 1.22 at z = 0.34 — ~ 10% smaller 
than the bias value of our best-fit HOD. 

Figure [7] plots the best-fit HOD for each measurement 
(same colour scheme as Figure |5| . The largest differences 
are at the low mass end, where best-fit value of a cut affects 
the shape of the HOD by controlling the sharpness of the 
mass cutoff. The best-fit value of M cu t remains roughly con- 
stant (logio (M CU i h/Mo) is always between 12.05 and 12.16) 



Figure 7. The best-fit HOD models for the five photometric 
redshift slices 0.1 < z < 0.2 (black), 0.15 < z < 0.25 (red), 
0.2 < z < 0.2 (blue), 0.25 < z < 0.35 (green), 0.3 < z < 0.4 
(magenta). The inset plot displays the same data at high mass 
with (jV) scaled linearly. 



allowing the value of a cut to be the dominant factor in the 
shapes of the best-fit HODs. The trend we find in a cut is not 
completely unexpected, as the change in distance modulus 
is larger across lower redshift bins and thus a wider range of 
luminosities may contribute to the mass cut-off scale. One 
might be more comfortable with the results, however, if the 
change in a cu t were not so large. We investigate this further 
in pXl| 

At high mass, the HOD models look similar, though 
the high-redshift haloes host slightly more galaxies per halo. 
This change is due to the best-fit value of Mi, which we find 
to decrease with redshift for the four samples with z > 0.15. 
The decrease is rather dramatic between the 0.25 < z < 0.35 
and the 0.3 < z < 0.4 samples (logio(Mi H/Mq) decreases 
from 13.244 to 13.177), which causes the 0.3 < z < 0.4 
best-fit HOD to be significantly higher at the high mass end 
than any other sample. This also has a consequence for the 
satellite fraction corresponding to the best-fit HOD. For the 
four samples with z < 0.35, the satellite fraction stays within 
6% of 0.2 and there is no trend. These satellite fractions that 



we find are similar to those found by |Zheng et al. (20071 for 
SDSS galaxies with M r - 51og(ft) < -20.5. For the 0.3 < 
z < 0.4 sample, the satellite fraction jumps to 0.245. We 
discuss this further in §4.2| Across all of our samples, the 
best-fit value of a has no trend, and it is consistent enough 
that it affects no noticeable difference in the shape of the 
best-fit HODs. 



4-1.1 Fixed a cut 

The differences in the best-fit HOD for each redshift range 
are driven in large part by the changes in a cut- The best- 
fit values of a cu t generally decrease with redshift and thus 
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Table 2. The best-fit values of the HOD parameters (see Equations \20\ and j2l| )and la errors and the associated \ 2 values for the 
five redshift slices studied, fit between 0.004° and 1°. All masses are in units MqK~ . The parameters b\ and f sa t are the linear bias and 
the satellite fraction, given the best-fit HOD parameters and calculated using Equations p2t and (123b , respectively. 



Redshift Range 


a 


logio (M cut ) 


logio (Mi) 


a cut 


X 2 /dof 


6 


fsat 



0.1 < z 


< 0.2 


1 093+ 012 
i.uao_ 015 


12.162 


13.303±0.008 


04g -0.09 


10.8/21 


1.187 


0.203 


0.15 < 


z < 0.25 


1.103±0.008 


12.154 


13.319±0.005 


49-f - 06 

u -^ a -0.04 


17.4/21 


1.19 


0.190 


0.2 < z 


< 0.3 


1 100+ 008 
r.±uu_ 01 


12.089 


13.300±0.005 


0.33±0.04 


19.0/21 


1.231 


0.199 


0.25 < 


z < 0.35 


1.096±0.004 


12.051 


13.254±0.002 


0.23±0.02 


31.9/21 


1.286 


0.212 


0.3 < z 


< 0.4 


1.075±0.005 


12.064 


13.177±0.003 


0.2±0.03 


33.6/21 


1.347 


0.245 



Table 3. Same as Table|2| only the parameter a cu t is fixed at 0.3. All masses are in units M@h 



Redshift Range a logio (M C ut) logio (Mi) x 2 /dof b 1 f sat 



0.1 < z < 0.2 


1.13 


12.083 


13.326 


11.9/22 


1.205 


0.204 


0.15 < z < 0.25 


1.126 


12.075 


13.339 


23.1/22 


1.206 


0.190 


0.2 < 2 < 0.3 


1.10 


12.080 


13.30 


19.8/22 


1.233 


0.200 


0.25 < 2 < 0.35 


1.093 


12.070 


13.254 


34.9/22 


1.286 


0.209 


0.3 < 2 < 0.4 


1.078 


12.089 


13.175 


38.0/22 


1.346 


0.243 




Figure 8. Same as Figure]?] only for a c 



: 0.3. 



cause a sharper mass cutoff at higher redshift. To investigate 
whether our results are potentially biased by the changes in 
this parameter (which may or may not be physical) we there- 
fore fix (Tout = 0.3 and re-calculate the best-fit HODs for 
each redshift slice, with these best-fit parameters presented 
in Table [3] The increase in the best-fit \ 2 value is greater 
than la only for the 0.15 < z < 0.25 and 0.3 < z < 0.4 
slices. The increase in the x 2 values for the lower redshift 
samples is due to the fact that the lower a cu t value forces 
the overall bias to be larger and thus less consistent with 
our measurements at large angular scales. 

Figure [5] displays the best-fit HODs for a cut = 0.3. 



They look extremely similar to each other — only the 
0.3 < z < 0.4 best-fit HOD is distinguishable from the other 
curves. This suggests that the spread in best-fit HODs that 
we obtain when we leave a cu t as a free parameter is caused 
by the uncertainty inherent in our best-fit HODs, i.e., there 
is some degeneracy between a cut and M cut . This also illus- 
trates the fact that having the same HOD at two differ- 
ent redshifts actually implies quite different clustering. The 
0.1 < z < 0.2 and 0.25 < z < 0.35 best-fit HODs are nearly 
identical, yet the bias is 10% higher for the 0.25 < z < 0.35 
best-fit HOD. We discuss the implications of this further in 

m 



4.2 Splitting by Type 

As in R09, we can gain insight by splitting our sample by 
type value and measuring the respective auto-correlation 
functions. This allows us to determine if the trends we ob- 
serve are driven by a single galaxy type. In this work, we 
refer to galaxies with t > 0.1 as late-type and those with 
t < 0.1 as early- type. This split is motivated by the fact that, 
at low-redshift, it yields similar distributions as a u — r = 2.2 



split in colour (as suggested bylStrateva et al. 2001 the type- 



value split should perform better as a function of redshift 
than this simple colour cut). We fit HOD models to the 
early- and late- type oj(8) measurements by assuming that 
the fraction of late-type galaxies that are central galaxies, 
f c , exhibits a decrease with mass parameterised by 



fc(M) = f c0 exp 



-logio (M/Mcut 



(24) 



and the fraction of satellite galaxies that are late-type, f a , 
exhibits a decrease with mass parameterised by 



fs(M) = f s0 exp 



-logio (M/Mp) 



(25) 



10 A. J. Ross, W. J. Percival, & R. J. Brunner 



10 



0.1 



0.1 



0.01 




100 



10 



7-. 



0.1 



0.01 



0.001 



10 



0.1 



0.01 



0.001 



i i i 1 1 1 1 




\ \\ \ i 1 1 m il l 




1Q 11 10 12 1Q 13 1Q 14 1015 

M (h- 1 M e ) 



Figure 9. Left panels: the measured (error bars) angular 2-point correlation functions for five photometric redshift slices 0.1 < z < 0.2 
(black), 0.15 < z < 0.25 (red), 0.2 < z < 0.2 (blue), 0.25 < z < 0.35 (green), 0.3 < z < 0.4 (magenta) for early-type (top) and late-type 
galaxies (bottom), compared against the best-fit model u)(9). Right panels: the best-fit HOD models for the early- (top) and late-type 
galaxies (bottom; same colour scheme as left-hand panels) 



As in R09, we find f c o for every combination of f s o, a cen , 
and a sat by requiring that the overall fraction of late-type 
galaxies matched the observed fraction of late-type galaxies. 

We also employ the 'minimal mixing' modeling con- 
straints described by Equations 20-23 of R09 in order to 
calculate the model ui{9) of early- and late-type galaxies. 
These equations place the constraint that early- and late- 
type galaxies occupy separate halos, as much as the over- 
all statistics allow, onto the calculation of the model uj(6). 



Generally, such a model does not allow galaxies of differ- 
ent type to exist in the same low-mass halos but will allow 
late-type galaxies to exist as satellites, around early-type 
central galaxies, in more massive halos. This is opposed to 
a 'full-mixing' model, which places no constraints and thus 
assumes no environmental dependence other than mass. 

The measured u(9) for early- (top) and late- type (bot- 
tom) galaxies are displayed in the left-hand panels of Figure 
pi plotted against the best-fit HOD model (solid lines; same 
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Table 4. The best-fit values of the HOD parameters and the associated \ 2 values for the early- and late-type samples studied. 
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Sample f c0 f s0 cr cen a aat X 2 /dof b lilate 

"l , early 



0.1 < z 


< 0.2 


0.42 


0.33±0.01 


0.47±0.03 


4.1±0.6 


61/44 


1.02 


1.28 


0.15 < 


z < 0.25 


0.36 


0.34±0.01 


0.50±0.04 


2.7±0.5 


79/44 


1.02 


1.28 


0.2 < z 


< 0.3 


0.57 


0.31±0.01 


0.39±0.03 


3.2±0.4 


60/44 


1.06 


1.32 


0.25 < 


z < 0.35 


0.88 


0.42±0.01 


0.34±0.03 


3.8±0.4 


109/44 


1.13 


1.39 


0.3 < z 


< 0.4 


0.93 


0.34±0.01 


22+ 04 
u - z -0.03 


3.6±0.3 


70/44 


1.16 


1.43 



colour scheme as Figure [5j. The general results are con- 
sistent with previous results (e.g., |Zehavi et al.|[2005] R09); 
the early-type galaxies have larger amplitudes than the late- 
types, and the shape of lj(8) at intermediate angular scales 
(between about 0.01° and 0.1°) is concave for the early-type 
galaxies and convex for the late- type galaxies. 

Interestingly, the differences between the 0.25 < z < 
0.35 and 0.3 < z < 0.4 measurements are not dramatic 
when the galaxies are split by type (unlike for the full sam- 
ple. When split by type, the bias values of the best-fit models 
for the galaxies in the 0.3 < z < 0.4 slice are only moderately 
larger than that of the 0.25 < z < 0.35 slice. This suggests 
that the differences in the full sample are driven by the fact 
that fi a te decreases dramatically in the 0.3 < z < 0.4 sam- 
ple, causing the small-scale amplitudes to be dramatically 
larger (and a high satellite fraction found for the best-fit 
HOD) and the overall bias to be larger. We may therefore 
wish to consider only the galaxies with z < 0.35 when dis- 
cussing the overall implications of the evolution we observe. 

The goodness of fit is not ideal for any of the samples, as 
the x 2 /DOF presented in Table [4] are significantly greater 
than 1 in each case. We note, however, that in each case the 
minimum \ 2 are f ar smaller than what is achievable with 
a model that allows mixing. The models have difficulty re- 
producing the shape of the late-type measurements around 
where they exhibit an increase in slope (~ 0.02°). In each 
case, the best-fit model increases in slope at a larger angular 
scale than the measurement does. This suggests imperfec- 
tions in model. We do not attempt to improve the model, 
but we do note that the assumption that late-type galax- 
ies are distributed in halos like an NFW profile is probably 
incorrect (as one would infer from the morphology-density 
relationship, see, e.g., Dressler|1980 1. 

The shapes of the best-fit HODs of the early- (top) and 
late-type (bottom) galaxies are displayed in the right-hand 
panels of Figure [9] The f c o, fso, (Teen, and a sa t parameters 
that define these fits are are presented in Tabel|4] The mass 
cutoff profiles show similar behavior as the best-fit HODs of 
the full samples — the cut-off grows increasingly sharp with 
redshift for both the early- and late-type best-fit HODs. The 
shapes of the late-type HODs show significant differences at 
around 10 12 fe _1 Mq, where each HOD shows a local maxi- 
mum. Closer inspection reveals that the values of a cut , er cen , 
and f c o are strongly correlated — a smaller a cu t results in 
a smaller a cen , a larger / c o, and a sharper peak at the local 
maximum. We thus do not attribute any special significance 
to this behaviour. The evolution we find in the early-/late- 
type HODs appears to be driven primarily by the evolution 
of the best-fit HODs of the full samples. 



At larger scales, where the 2-halo term dominates, there 
is good agreement between the models and the measure- 
ments, and we can therefore trust the bias of each best-fit 
model (presented in Table |4j. The bias increases by ~ 14% 
and ~ 12% for the late- and early-type galaxies, respec- 
tively. This increase in bias is significantly greater than the 
few percent change one would expect of a passively evolving 
sample (for b — 1.43 at z = 0.34, the passively evolved sam- 
ple would have b = 1.39 at 2 = 0.16; for b = 1.16 at z = 0.34, 
the passively evolved bias would be 1.15 at z = 0.16). 

The fact that both the early- and the late-type galaxies 
display significant increases in bias over that of a passively 
evolving sample means that we cannot attribute the evolu- 
tion in bias that we observe in the full sample to galaxies of a 
certain type. Either the average halo bias of galaxies of both 
type is decreasing, or the contamination between our sam- 
ples is large enough (due to, e.g., edge-on late-type galaxies 
reddened by dust lanes; Masters et al. 20101 to cause the 



bias of both samples to decrease. The main conclusions we 
can draw from our measurements of the correlation func- 
tions of early- and late-type galaxies are that the minimum 
mixing model continues to be favoured over one that allows 
uninhibited mixing and that the inconsistencies we found in 
the clustering of the 0.3 < z < 0.4 galaxies are due primarily 
to this sample's relatively low late-type fraction. 



5 CROSS-CORRELATIONS AS A 
SYSTEMATIC TEST 

One issue with the potential to cause systematic errors in 
the interpretation of our measurements is our estimation of 
the redshift distribution of each of our galaxy samples. If our 
distributions were systematically affected such that we over- 
predict the width of the distributions, we would incorrectly 
interpret our clustering measurements as having a higher 
bias. If, for example, the magnitude of this problem grew 
with redshift, it would cause us to (incorrectly) determine 
that the bias was growing larger with redshift. 

One way to test the accuracies of our redshift distri- 
butions is to perform an angular cross-correlation measure- 
ment, uj x {9), between redshift bins. We therefore measure 
three lo x {8): between 0.1 < z < 0.2 and 0.2 < 2 < 0.3; 
0.15 < 2 < 0.25 and 0.25 < 2 < 0.35; and 0.2 < 2 < 0.3 and 
0.3 < 2 < 0.4. In each of these cases, any cross-correlation 
signal is due to the fact that, while there is no overlap in 
the photometric redshift, the errors on the photometric red- 
shifts imply that the true redshift distributions overlap (as 
can be seen clearly in Figure [3}. One can define a factor W x 
as 
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Figure 10. The measured (error bars) angular cross-correlation 
functions between galaxies in the 0.1 < z < 0.2 and 0.2 < z < 
0.3 redshift slices (bottom), the 0.15 < z < 0.25 and 0.25 < 
z < 0.35 redshift slices (middle), and the 0.2 < z < 0.3 and 
0.3 < z < 0.4 redshift slices (top), all with amplitudes divided 
by the power-law O.OO40 -08 . In each panel, the measured auto- 
correlation, multiplied by W x /W and divided by the same power- 
law, of galaxies from the intervening bin (0.15 < z < 0.25 bottom, 
0.2 < z < 0.3 middle, 0.25 < z < 0.35 top) is displayed with a 
solid black line. In the bottom panel, the dashed line displays the 
measured auto-correlation of late-type galaxies with 0.15 < z < 
0.25, multiplied by W x /W, again divided by the power law. 



W x = 1/c / dz H(z)dN/dz 1 dN/dz 2 , 



(26) 



where we have simply replaced (dN/dz) 2 from Equation ^ 
with the multiple of the two redshift distributions of the 
galaxies that are being cross-correlated. 

Similarly to W, the W x factor quantifies how much of 
the underlying clustering signal we should observe. We can 
expect that this underlying clustering signal of these cross- 
correlations should be similar to the real-space clustering 
of the galaxies in the intervening bin (i.e., for the cross- 
correlation between the 0.1 < z < 0.2 and 0.2 < z < 0.3 
redshift slices, the real-space galaxy clustering signal should 
be similar to that of the 0.15 < z < 0.25 sample). Therefore, 
we can expect, based on our redshift distributions, to mea- 
sure a cross-correlation signal with amplitude W x /WiU)i(6), 
where Wi and u>i(9) are calculated using the redshift distri- 
bution of the intervening redshift slice. If we do not measure 
such a signal, it implies that our redshift distributions may 
be estimated incorrectly. 

The three panels of Figure [To] display the three cross- 
correlations (error-bars) we measure compared to the mea- 
sured auto-correlation multiplied by W x /W (solid lines) of 
the galaxies in each of the respective intervening bins. The 
bottom panel displays the cross-correlation of the 0.1 < 
z < 0.2 and 0.2 < z < 0.3 redshift slices (black error- 
bars), compared to the measured auto-correlation of the 



0.15 < z < 0.25 redshift slice multiplied by W/W x (solid 
black line). All of the data displayed is divided by the power- 
law O.OO40 -0 ' 8 , for clarity. At large scales, where the 2- 
halo term dominates, the measurements are consistent. The 
error-bars on the cross-correlation are quite large. This is 
due to the fact that the overlap between the two redshift 
slices is small (W x /W is 0.278). 

The overlap between the 0.1 < z < 0.2 and 0.2 < z < 
0.3 redshift distributions is mainly at the high-redshift tail 
of the 0.1 < z < 0.2 slice and the low-redshift tail of the 
0.2 < 2 < 0.3 slice (see Figure |3j. This suggests that much 
of the cross-correlation signal is due to pairs of relatively 
high luminosity late-type galaxies (from the 0.1 < z < 0.2 
slice) and relatively low luminosity late-type galaxies (from 
the 0.2 < z < 0.3 slice), since late-type galaxies have the 
highest redshift errors and are thus most likely to occupy 
the tails of the redshift distribution. The dashed line in 
the bottom panel of Figure [10] displays the measured au- 
tocorrelation of late-type galaxies with 0.15 < z < 0.25, 
multiplied by W x /W. Its values are more consistent with 
the measured cross-correlation, suggesting that late-type 
galaxies do indeed dominate the clustering signal of cross- 
correlation. The late-type amplitudes are smaller than that 
of the cross-correlation, suggesting that while the late-type 
galaxies dominate measurement, there is still some contri- 
bution from early-type galaxies (as one would expect since 
they are included in the measurement). We therefore be- 
lieve this cross-correlation measurement is consistent with 
the redshift distributions we estimate for the 0.1 < z < 0.2, 
0.15 < z < 0.25, and 0.2 < z < 0.3 slices 

The middle panel of Figure [To] displays the cross- 
correlation between the 0.15 < z < 0.25 and 0.25 < z < 0.35 
redshift slices (black error-bars) compared to the auto- 
correlation of the 0.2 < z < 0.3 redshift slice multiplied 
by W x /W (solid black line). At large scales, the two mea- 
surements agree extremely well, and we once again mea- 
sure smaller amplitudes for the cross-correlation at small 
scales, but to lessor degree than in the bottom panel. This 
makes sense given our previous explanation, as the over- 
lap between these redshift slices is greater (W x /W has in- 
creased to 0.327) and the resulting cross-correlation is not 
dominated to the same degree by galaxies that are likely to 
be found in the tails of the redshift distribution. Thus, once 
more we find the cross-correlation measurement to be con- 
sistent with our estimation of the redshift distributions, in 
this case those for the 0.15 < z < 0.25, 0.2 < z < 0.3, and 
0.25 < z < 0.35 slices. 

The top panel of Figure[l0]displays the cross-correlation 
between the 0.2 < z < 0.3 and 0.3 < z < 0.4 redshift 
slices (black error-bars) compared to the auto-correlation 
of the 0.25 < 2 < 0.35 redshift slice multiplied by W x /W 
(solid black line). In this case, the measurements do not 
agree as well at large scales, but do agree better at small 
scales. The measured cross-correlation is larger than we 
would expect at large scales. This suggests two possibili- 
ties; either the bias of galaxies that contribute to the cross- 
correlation measurement are significantly higher than those 
in the 0.25 < z < 0.35 redshift slice, or W x is in truth larger 
than we calculate because we have incorrectly estimated the 
redshift distributions. The simplest explanation is that the 
redshift distribution of the 0.3 < z < 0.4 is wider than we 
have estimated. This would imply only that the bias for this 
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redshift slice has been under-estimated, and it would only 
strengthen our findings that the bias of galaxies increases 
with redshift beyond what one would expect from passive 
evolution. 

Our cross-correlation measurements suggest that our es- 
timation of the redshift distributions has not introduced a 
systematic error that could be responsible for the trend we 
find with bias. We are, therefore, confident that the trend we 
find — that with increasing redshift the linear bias of galax- 
ies with M r < —21.2 increases significantly beyond that of 
a passively evolving sample of galaxies — is real. 



6 PHYSICAL INTERPRETATION 

We find that the bias of galaxies with M r < —21.2 increases 
with redshift significantly beyond the increase in bias one ex- 
pects of a passively evolving system. This implies that the 
galaxies themselves must be evolving under the influence 
of physical interactions, such as mergers, star formation, or 
cannibalisation of satellites, etc.. We can rule out the passive 
effect of ageing stellar populations as a cause for our observa- 
tions. This effect would cause the galaxies to dim and there- 
fore suggests that galaxies at z ~ 0.3 should be compared 
to less luminous galaxies at z ~ 0.1. However, if we were 
to include less luminous galaxies in our 0.1 < z < 0.2 red- 
shift slice, we would measure an even lower bias. We there- 
fore know that attempting to account for such evolution, as 
measurements of the luminosity function (e.g., IBlanton etl 



al. 2003[ ) suggest is present, would only enhance the trend 
we observe. 

If we had cut at approximately L* for each sample 
(which would have meant a difference of ~ 0.3 in M r over 
our full redshift range based on the evolutionary factors de- 
termined by Blanton et al.|[2003 l, the increase in bias with 
redshift that we observed would have been even stronger. 
This implies that our M cu t values likely would have been 



smaller at low redshift than at high. Zheng et al. (20071 



found that galaxies from DEEP2 had a higher M cu t than 
those selected from SDSS with similar L/L* values, which 
is thus in agreement with our results. Using a fixed luminos- 
ity cut, however, allows us to obtain a fundamental result — 
galaxies of the same luminosity reside in halos of approxi- 
mately the same mass, independent of any change in redshift 
between 0.1 and 0.4. 



a passively evolving population, as a weighted prediction for 
the evolution of the bias of each sample. 

We adopt the following procedure: First, we calculate 
B(M) for each halo mass of the given best-fit HOD (which 
is evaluated at its own respective median redshift). We then 
take this bias and use Equation Q to obtain the passively 
evolved z = bias of the halo, which we denote b . Com- 
paring our best-fit HODs as a function of this statistic re- 
moves the passive effect of large-scale evolution, leaving the 
effects of mergers and galaxy formation. The effect of merg- 
ers should be small as relative effects on the bias of galaxies 
leaving a sample, and those joining, should cancel to a large 
degree. Thus, for each HOD, we obtain (N) as a function of 
b , giving us HOD results at different redshifts that we can 
compare, having accounted for halo growth in the mean. 

we plot (with the same colour scheme as 



In Figure 



11 



Figure |5| the mean occupation of galaxies versus b . The 
curves at the high-bias end are extremely similar, and there 
is no trend with redshift. The major difference between each 
curve is the value of the bias at which the HOD shows sig- 
nificant decline. This suggests that the difference between 
our samples is due almost entirely to evolution in the min- 
imum halo bias required for a halo to host a galaxy with 
M r < -21.2. 

Figure [12] plots the best-fit HODs against b a for a cut = 
0.3. All five curves lie nearly on top of each other at b a — 
1.1. Going to lower b values, they separate such that the 
cut-off b a decreases with redshift, just like as in Figure [TT) 
Thus, both figures suggest that at low redshift, galaxies with 
M r < —21.2 exist in halos in which they did not exist in at 
higher redshift, i.e., ~ L* galaxies are being created (due 
to, e.g., galaxies merging, accreting satellite galaxies in low- 
mass halos, or undergoing a burst of star formation) in halos 
with masses around our nearly constant value of M cu t ~ 
1.2 x 10 12 h~ 1 M e . 

At large b , both Figures [TT] and [T2| show all of the best- 
fit HODs to exhibit the extremely similar behaviour, as for 
both the maximum difference is only ~ 10% and there is no 
trend with redshift (the highest and lowest redshift slices are 
nearly identical, as are the three samples in the middle). Our 
measurements thus require little evolution in the occupation 
of halos with bias greater than 1.2, as our nearly constant 
satellite fractions suggest. Thus, even as they grow in mass, 
halos with b a > 1.2 do not see a significant increase in the 
number of galaxies that occupy them. 



6.1 HOD versus z=0 Bias 

Our best-fit HOD models imply that the HOD evolves very 
little with redshift, but this lack of evolution actually implies 
significant evolution in the large-scale bias of the galaxies. In 
order to remove the effect of halo mass growth between red- 
shift slices, we can match HODs by determining the present 
day bias of passively evolved dark matter halos. Evolution 
in bias is not deterministic, but is statistical in nature: the 
mass of each halo will follow an evolutionary track with 
a strong stochastic element, that is independent from the 
large-scale clustering. When analysing a sample of halos at 
different redshifts, it is therefore difficult to try to disen- 
tangle the effects of structure growth with the growth of 
galaxies. However, we can use the known bias evolution for 



6.2 Robustness Against Changes in Covariance 
Matrices and Cosmology 

Our physical interpretation is based on results determined 
from data with covariance matrices estimated with a jack- 
knife method and models using one specific set of cosmolog- 
ical parameters. One may therefore worry that our results 
may not be robust to changes in the way we estimate co- 
variance matrices or the cosmology we assume in the mod- 
els. To learn about the degree to which our results depend 
on the specific form of the covariance matrices we use, we 
assume the same percentage error on all measurements and 
re-determine the best-fit HODs for the 0.1 < z < 0.2 and 
0.25 < 2 < 0.35 samples. Fixing a cu t = 0.3, we find a 
changes to 1.12 for the 0.1 < z < 0.2 sample and 1.104 for 
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Figure 11. The best-fit HOD models for the five photometric 
redshift slices, plotted against the passively evolved present day 
value of the halo bias. The inset plot displays the same data at 
high b with (N) scaled linearly. The colour scheme is the same as 
for previous plots( 0.1 < z < 0.2 (black), 0.15 < z < 0.25 (red), 
0.2 < z < 0.2 (blue), 0.25 < z < 0.35 (green), 0.3 < z < 0.4 
(magenta)). 
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Figure 13. The best-fit HODs for 0.1 < z < 0.2 (black) and 
0.25 < z < 35, with cr cu t fixed at 0.3, and plotted against the 
z = passively evolved halo bias, b . The solid lines display the 
original best-fit (same data as Figure [l2| l, the dotted line displays 
the best-fits obtained when the error on the u>{9) is assumed to 
be the same percentage in all angular bins, and the dashed line 
displays the result for a flat universe with f2 m = 0.25 and Q;, = 
0.04 (and using the original covariance matrices). 



0.1 <z<0.2 




Figure 12. Same as Figure [TT] only for a cu t = 0.3. 

the 0.25 < z < 0.35 sample, while logio(Mift/Mo) changes 
to 13.330 and 13.258, respectively. 

The changes in both parameters are minor, as can be 
seen in Figure |13| This figure displays the best-fit HODs, 
determined using a fixed percentage error, plotted with dot- 
ted lines, and the original data is plotted with solid lines 



(black for 0.1 < z < 0.2 and green for 0.25 < z < 0.35 in 
both cases), against b . There are no significant differences 
(the dotted lines are barely identifiable); clearly treating the 
error in this manner would have no effect on our physical 
interpretation. This suggests that our physical interpreta- 
tion is robust to any reasonable change in the treatment of 
our error-bars/covariance matrices, given that the results of 
Norberg et al.| ( [2009[ ) suggest jack-knife covariance matrices 
should perform far better than simply assuming a constant 
percentage error. 

We also test our results for a different set of cosmolog- 
ical parameters. We still assume a flat universe, but change 
Q m to 0.25 and f2(, to 0.04. This change does have a signifi- 
cant effect on the HOD parameters — a changes to 1.115 for 
the 0.1 < z < 0.2 sample and 1.072 for the 0.25 < z < 0.35 
sample, while logio(Mi/i/M ) changes to 13.240 and 13.172, 
respectively. The resulting HODs are plotted in Figure |13| 
against b Q with dashed lines (black for 0.1 < z < 0.2 and 
green for 0.25 < z < 0.35). While the changes in the HOD 
parameters are significant, they cause only small changes 
in the HODs when plotted against b . For 0.1 < z < 0.2, 
the new cosmology causes a slight shift to larger b values 
and for 0.25 < z < 0.35 the two HODs are barely distin- 
guishable. The lack of change in the best-fit HODs when 
plotted against 6 is due to the fact that the change in the 
cosmological parameters changes the form of B(M) — and 
thus most of the change in the best-fit HOD parameters 
simply reflects the change in B(M). Therefore, the changes 
when the HODs are plotted against b are minor. It appears 
clear that reasonable changes in the cosmology we assume 
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would not cause any significant change in the physical inter- 
pretation of our measurements, and our conclusions should 
therefore be robust. 



6.3 Evolution of Occupation Number 

Another way to look at our results is to plot the change in 
occupation number divided by the change in mass as a func- 
tion of b , ATV/AAf (b ). Figure 14 displays this information, 



in units of 10~ HMq , against the average halo mass for 
constant b values (vertical lines denote b = 0.9, 1.0, 1.5, 
and 3.0 for reference). The four curves display the change 
between the 0.3 < z < 0.4 and 0.1 < z < 0.2 best-fit HODs 
(solid black line), the 0.25 < z < 0.35 and 0.1 < z < 0.2 
best-fit HODs (dotted red line), the 0.25 < z < 0.35 and 
0.1 < z < 0.2 best-fit HODs with a cut fixed at 0.3 (dashed 
blue line), the 0.25 < z < 0.35 and 0.1 < z < 0.2 best- 
fit HODs with a cut fixed at 0.3 after changing the assumed 
(flat) cosmology to f2 m = 0.25 and fit = 0.04 (long-dashed 
green line). 

All four curves in Figure [14] display a strong peak at 
b a ~ 0.9, and this corresponds to an average halo mass of 
~ 10 12 A _1 Af Q . Halos with b Q = 0.9 gain ~ 7.5 x 10 u /i _1 Af s 
mass between z = 0.34 and z = 0.16. Thus, given AN/ AM 



7 x !Q- LS hM, 







this accreted mass allows the cre- 
ation of an average of ~ 0.5 galaxies per halo. We have 
thus presented evidence for the following scenario: the bias 
of M r < —21.2 galaxies decreases as redshift decreases be- 
cause these galaxies form preferentially in halos with masses 
~ 1O 12 /i _1 M0 (independent of redshift) and the bias of 
these halos naturally decreases as the Universe evolves. 

At the high b end of Figure |14| our results are incon- 
clusive. Looking at the evolution between 0.1 < z < 0.2 
and 0.3 < z < 0.4, our results suggest that highly biased 
halos are actually losing galaxies, which would imply that 
the number of galaxies that leave the sample, either through 
mergers or dimming, is greater than the number of galaxies 
the halos accrete as they accrete mass. Our measurements 
from the 0.3 < z < 0.4 slice may not be reliable, however, 
so a comparison between 0.1 < z < 0.2 and 0.25 < z < 0.35 
(red dotted line) may be more safe. This comparison sug- 
gests that the increase in number slowly decreases, but the 
same comparison for the best-fit models with a cut =0.3 sug- 
gest a slow increase in the number with bias. In both cases, 
however, the maximum increase for halos with Mq > 10 13 
is ~ 0.5 x 10~ 13 hM^ 1 . Since all of our best-fit HODs have 
(N) > 1 for halo of mass 10 13 /i _1 Mq, this gain in occupa- 
tion number does suggest that there is a net loss of galaxies 
in high mass halos, as would occur if galaxies merge after 
entering the halo. 

Figure [14] suggests a required mass threshold is ~ 
10 12 /i _1 M Q for M r < -21.2 galaxies. This result is in basic 



agreement with numerical models (e.g., Shankar et al. 2006 1, 
which show that the fraction of baryonic mass converted into 
stars peaks at ~ 10 12 /i _1 Mq and also the recent results of 



Guo et al. (20091, which suggest galaxy formation efficiency 
peaks at a halo mass slightly lower than 10 12 /i _1 Mq. This 
implies the following scenario: the ability of a halo to host a 
galaxy with M r < —21.2 is tied to its ability to convert its 
baryonic matter into a sufficient number of stars coalesced 
into a single galaxy, the efficiency with which a halo can do 
this is related to its mass, and this implies that the bias of 




Figure 14. The change in occupation number, divided by the 
change in halo mass, in units of 10~ 







versus the average 
halo mass at constant passively evolved z = bias of the halo. 
The solid black line displays result comparing the 0.1 < z < 0.2 
and 0.3 < z < 0.4 best-fit HODs, the dotted red line displays 
the result when comparing the 0.1 < z < 0.2 and 0.25 < z < 
0.35 best-fit HODs, the dashed blue line displays the result when 
comparing the 0.1 < z < 0.2 and 0.25 < z < 0.35 best-fit HODs 
when cr cu t is fixed at 0.3, and the long dashed green line displays 
the result when comparing the 0.1 < z < 0.2 and 0.25 < z < 0.35 
best-fit HODs when cr C ut is fixed at 0.3 and the cosmological 
model is changed to a flat universe with Q m = 0.25 and H;, = 0.04. 
The vertical lines denote selected b values. 



the halos in which galaxies form decreases as the Universe 
evolves. We therefore find a peak in the AN/ AM versus b 
relationship at the b value that corresponds to having the 
most halos cross the ~ 10 12 /i _1 M(7) mass threshold. 



7 CONCLUSION 

We have calculated the angular auto-correlation functions 
of SDSS DR7 galaxies with M r < —21.2 in five overlapping 
photometric redshift slices with 0.1 < z < 0.2, 0.15 < z < 
0.25, 0.2 < z < 0.3, 0.25 < z < 0.35, and 0.3 < z < 0.4 
and we found best-fit HODs for each sample by applying 
the halo model. The most relevant results are: 

• The bias increases with redshift and the increase is far 
greater than one would expect of a passively evolving sample 
of galaxies. 

• When we split our sample by galaxy type into early- and 
late-type samples, we find the increase in bias is similar for 
both samples. We also find that the clustering of early- and 
late-type galaxies is better fit by a minimal mixing model 
(as presented in R09) than one that allows galaxies to mix 
freely within halos, though the high v 2 values of our best-fit 
results suggest that the model needs to be further refined. 

• The best-fit HODs of our full sample suggest that the 
mass cut-off remains nearly constant with redshift (espe- 
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cially for fixed (J cut)- Since halos grow in mass as the Uni- 
verse evolves, one would expect that for a passively evolving 
sample, the mass cut-off would increase as the redshift gets 
smaller. Interpreted in terms of the z = halo bias, b a , 
this constant mass cut-off implies smaller cut-off value of 6 
at lower redshift, and this allows the lower redshift galax- 
ies to have a lower overall bias. This implies that galaxies 
with M r < —21.2 are forming (via mergers, delayed star 
formation, accretion of dwarf galaxies, etc.) in increasingly 
less-biased halos as the Universe evolves. 

• Comparing the change in occupation number versus the 
change in mass for halos with constant b a , we find a strong 
peak at b ~ 0.9. This bias value corresponds to an av- 
erage halo mass of 10 12 /i _1 Mq, suggesting that galaxies 
with M r ~ —21.2 form preferentially in halos of mass 
10 12 h- 1 M e . 

• Our results are consistent with previous measurements 



made testing the evolution of the HOD (e.g., Zheng et al. 
20071 and numerical models (e.g., Shankar et al. 2006 1 which 



predict maximum star formation efficiencies for in halos of 
mass 10 12 h -1 M Q . 

• Our results are robust against changes in our treatment 
of the error- bars/covariance and changes in the underlying 
cosmology. It would be ideal to confirm our results via sim- 
ulations or semi-analytic galaxy formation models, but we 
leave such investigations for future study. 

Future surveys will be able to provide significant follow- 
up to these results. DES will allow our finding — that 
M r < —21.2 galaxies form of constant mass, independent of 
redshift between 0.1 < z < 0.4 — to be tested over a much 
wider range of redshifts and luminosities. If our results prove 
robust, we will be able to determine a fundamental relation- 
ship between the luminosity of a galaxy and the mass of the 
halo in which it forms. 
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